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FAST SYSTEM LEVEL RECOVERY 



BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates in general to computer-implemented database management 
5 systems, and, in particular, to a more efficient database recovery system. 

2. Description of Related Art 

Databases are computerized information storage and retrieval systems. A Relational 
Database Management System (RDBMS) is a database management system (DBMS) which uses 
relational techniques for storing and retrieving data. Relational databases are organized into 

10 tables which consist of rows and columns of data. The rows are formally called tuples. A 
database will typically have many tables and each table will typically have multiple tuples and 
multiple columns. The tables are typically stored on random access storage devices (RASD) 
such as magnetic or optical disk drives for semi-permanent storage. 

RDBMS software using a Structured Query Language (SQL) interface is well known in 

15 the art. The SQL interface has evolved into a standard language for RDBMS software and has 
been adopted as such by both the American National Standards Institute (ANSI) and the 
International Standards Organization (ISO). The SQL interface allows users to formulate 
relational operations on the tables either interactively, in batch files, or embedded in host 
languages, such as C and COBOL. SQL allows the user to manipulate the data. The definitions 

20 for SQL provide that a RDBMS should respond to a particular query with a particular set of data 
given a specified database content, but the technique that the RDBMS uses to actually find the 
required information in the tables on the disk drives is left up to the RDBMS. Typically, there 
will be more than one technique that can be used by the RDBMS to access the required data. 
The RDBMS will optimize the technique used to find the data requested in a query in order to 

25 minimize the computer time used and, therefore, the cost of performing the query. 
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Applications (i.e., application programs) communicate with relational databases using 
the Structured Query Language (SQL). SQL statements are issued to the RDBMS, which 
executes the SQL statements to access a relational database. The SQL statements received from 
an application are executed within a "unit of work", which is a collection of related SQL 
5 statements. Once the SQL statements are executed, they are either "committed" (i.e., the changes 
made by the executed SQL statements is made permanent in the relational database) or they are 
"rolled back" (i.e., the changes made by the executed SQL statements are removed, returning the 
relational database to the state it was in prior to execution of these statements). 

One important aspect of managing a RDBMS is creating backups of the objects to insure 

1 0 data integrity in the event of crashes or other computer misfortunes. Conventional systems often 
generate backups in a nondisruptive manner by allowing full read/write access to the data being 
concurrently copied. By making nondisruptive backups of all of the objects conventional 
systems necessarily create "fuzzy backups" (i.e., backups containing uncommitted data) since 
one or more objects may be altered while being processed by an application during the backup 

15 operation. In other words, the backup does not contain all of the current object information. 

In attempting to resolve this fuzzy backup problem, conventional systems perform a 
recovery procedure that retrieves objects or object data that was not initially recorded in the 
backup copy. Conventional systems, however, are not capable of selecting or determining which 
obj ects have been altered since the last backup operation. Therefore, in order to recover the most 

20 current objects, conventional systems resort to taking the system back to a point in time and 
recover all of the objects rather than only the objects that were actually altered. 

The conventional systems described above utilizes what is generically referred to as a 
two-phase commit system. This system is based on points of consistency. A point of 
consistency is a point in time at which the data in the database is consistent. The two-phase 

25 commit system ensures that either all or none of the databases are updated, and, thus, ensures that 
the databases remain synchronized. More specifically, a first point of consistency exists when 
a COMMIT statement is invoked. A new point of consistency is established when 
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the unit of work completes and the COMMIT statement has fully executed. The work completed 
or change implemented during this COMMIT interval is referred to as a unit of recovery (UR). 
Each UR represents, for example, an altered set of data or an altered object. The COMMIT 
statement ends a UR and commits the relational database changes that were made in that UR. 
5 While changes remain uncommitted, i.e., while an object or data is in the process of being 
altered, other application processes are unable to perceive these objects or data. Once 
committed, these database changes are accessible by other application processes. 

Since conventional systems perform the backup and recovery tasks by simply copying 
all of the objects as they existed at a certain time, some of the object or data changes copied to 

10 the backup are uncommitted, resulting in fuzzy backups. Subsequent recoveries cannot be in 
general accomplished by only restoring the backup copies. Thus, in order to ensure that an 
object contains current data, conventional systems copy all of the objects from the backup copy 
and scan and apply the database log, which records all of the activities relating to a database, to 
identify what changes were made to that object. Reviewing the database log for each object and 

1 5 applying the appropriate changes to a backup copy can be very time consuming and expensive 
resulting in inefficient database recovery procedures. 

In addition to copying all of the objects and applying the database log to these objects, 
an additional shortcoming of conventional systems is that they must backup objects at a 
"quiesce" point in time, i.e. the time when there are no uncommitted URs in the system. 

20 Establishing quiesce points is normally very disruptive for everyday operations. In order to 
create a quiesce point, the system suspends the creation of any new URs until all URs already 
started are committed. In the environments with long running URs, the attempts to create a 
quiesce point can bring the system to a standstill 

These shortcomings are further amplified when systems utilize multiple objects or large 

25 objects. For example, in environments with complex and practically unknown semantical 
integrity rules such as SAP® R/3®, Peoplesoft®, Baan®, etc., the only safe way to recover data or 
objects from a prior point in time is to bring the entire system, i.e. all the associated data and 
metadata back to that point in time. These systems consist of a large number of objects (tens of 
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thousands) and often a large amount of data (hundreds or thousands of GB). System recoveries 
based on the 'fuzzy' backups of these systems are even more expensive and time consuming. 
Consequently, users may experience longer system down time and data recovery, system 
disruption, and increased costs of performing database backups and recoveries. 
5 Accordingly, there is a need in the art for a framework for managing backup and 

recovery procedures in a database that identifies individual objects that have been altered, brings 
back the system to a particular time to recover those identified objects, and does so irrespective 
of the presence of uncommitted URs and quiesce points. That is, there is a need in the art for 
a more efficient object recovery system that can recover objects such that recovery times, 
10 database log analysis, system down time, and object recovery costs are minimized. 

SUMMARY OF THE INVENTION 
To overcome the limitations in the prior art described above, and to overcome other 
limitations that will become apparent upon reading and understanding the present 
specification, the present invention discloses a method, apparatus, and article of manufacture 
1 5 for an object-oriented framework for recovering data in a database. 

In accordance with the present invention, a database log which records database 
activities is scanned. One or more individual objects that are to be recovered to a target time 
with reference to a backup time are identified. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Referring now to the drawings in which like reference numbers represent corresponding 
parts throughout: 

FIG. 1 is a schematic illustrating a hardware environment in one embodiment of the 
present invention, and more particularly, illustrates a typical database system. 
25 FIG. 2 is a flow diagram illustrating processing performed by an obj ect recovery system 

in one embodiment. 
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FIG. 3 is a time chart illustrating in further detail a technique of identifying individual 
objects that are to be recovered by the object recovery system at a target time. 

FIG. 4A is a time chart illustrating a target time (t T ) occurring after a backup time (t B ). 

FIG. 4B is a time chart illustrating several scenarios involving CREATE, DROP, and 
5 NEW VOLUME operations. 

FIG. 4C is a time chart illustrating scenarios with REORG LOG(NO) and LOAD 
LOG(NO) operations. 

FIG. 4D is a time chart illustrating scenarios with checkpoints CHKPT-A, CHKPT-B, 
and CHKPT-C. 

10 FIG. 5 A is a time chart illustrating a target time (t T ) occurring before a backup time (t B ). 

FIG. 5B is a time chart illustrating several scenarios involving CREATE, DROP, and 
NEW VOLUME operations. 

FIG. 5C is a time chart illustrating scenarios with checkpoints CHKPT-A, CHKPT-B, 
and CHKPT-C. 

1 5 DETAILED DESCRIPTION 

In the following description of embodiments of the invention, reference is made to the 
accompanying drawings which form a part hereof, and which is shown by way of illustration 
specific embodiments in which the invention may be practiced. It is to be understood that other 
embodiments may be utilized as structural changes may be made without departing from the 

20 scope of the present invention. 

Hardware Architecture 
FIG. 1 is a diagram illustrating a computer hardware environment that could be used in 
accordance with the present invention. In the environment, a computer system 102 is comprised of 
one or more processors connected to one or more data stores 104 and 106 that store one or more 
25 relational databases in tablespaces. The data stores 104 and 106 may comprise a fixed or hard disk 
drive, a floppy disk drive, a CDROM drive, a tape drive, or other device. 
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Operators of the computer system 102 use a standard operator interface 108, such as 
IMS/DB/DC®, CICS®, TSO®, OS/390®, ODBC® or other similar interface, to transmit electrical 
signals to and from the computer system 102 that represent commands for performing various 
search and retrieval functions, termed queries, against the databases. In the present invention, 
5 these queries conform to the Structured Query Language (SQL) standard, and invoke functions 
performed by Relational DataBase Management System (RDBMS) software. 

The SQL interface has evolved into a standard language for RDBMS software and has 
been adopted as such by both the American National Standards Institute (ANSI) and the 
International Standards Organization (ISO). The SQL interface allows users to formulate 
10 relational operations on the tables either interactively, in batch files, or embedded in host 
languages, such as C and COBOL. SQL allows the user to manipulate the data. 

In an embodiment of the present invention, the RDBMS software comprises the DB2® 
product offered by IBM® for the OS/390® operating systems. Those skilled in the art will 
recognize, however, that the present invention has application to any RDBMS software, whether 
15 or not the RDBMS software uses SQL. 

As illustrated in FIG. 1, the DB2® system for the OS/390® operating system includes 
three maj or components: the Internal Resource Lock Manager (IRLM) 110, the Systems Services 
module 1 12, and the Database Services module 114. The IRLM 110 handles locking services 
for the DB2® system, which treats data as a shared resource, thereby allowing any number of 
20 users to access the same data simultaneously. Thus concurrency control is required to isolate 
users and to maintain data integrity. The Systems Services module 112 controls the overall 
DB2® system execution environment, including managing database log 106 data sets, gathering 
statistics, handling startup and shutdown, and providing management support. 

At the center of the DB2® system is the Database Services module 1 14. The Database 
25 Services module 114 contains several submodules, including the Relational DataBase 
Management System (RDBMS) or "database system" 116, the Data Manager 118, the Buffer 
Manager 120, and other components 122 such as an SQL compiler/interpreter. Additionally, an 
Object Recovery System 124 works in conjunction with the other modules. These submodules 
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support the functions of the SQL language, i.e. definition, access control, interpretation, 
compilation, database retrieval, and update of user and system data. 

The present invention is generally implemented using SQL statements executed under 
the control of the Database Services module 114. The Database Services module 1 14 retrieves 
5 or receives the SQL statements, wherein the SQL statements are generally stored in a text file 
on the data stores 104 and 106 or are interactively entered into the computer system 102 by an 
operator sitting at amonitor 126 via operator interface 108. The Database Services module 1 14 
then derives or synthesizes instructions from the SQL statements for execution by the computer 
system 102. 

10 Generally, the database software, the SQL statements, and the instructions derived 

therefrom, are all tangibly embodied in a computer-readable medium, e.g. one or more of the 
data stores 104 and 106. Moreover, the database software, the SQL statements, and the 
instructions derived therefrom, are all comprised of instructions which, when read and executed 
by the computer system 102, causes the computer system 102 to perform, implement and/or use 

15 the present invention. Under control of an operating system, the database software, the SQL 
statements, and the instructions derived therefrom, may be loaded from the data stores 104 and 
106 into a memory of the computer system 102 for use during actual operations. 

Thus, the present invention may be implemented as a method, apparatus, or article of 
manufacture using standard programming and/or engineering techniques to produce software, 

20 firmware, hardware, or any combination thereof. The term "article of manufacture" (or alternatively, 
"computer program product") as used herein is intended to encompass a computer program accessible 
from any computer-readable device, carrier, or media. Of course, those skilled in the art will 
recognize many modifications may be made to this configuration without departing from the scope 
of the present invention. 

25 Those skilled in the art will recognize that the environment illustrated in FIG. 1 is not 

intended to limit the present invention. Indeed, those skilled in the art will recognize that other 
alternative hardware environments may be used without departing from the scope of the present 
invention. 
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Fast System Level Recovery 

A. Overview 

One embodiment of the present invention provides an object recovery system 124 that 
executes in conjunction with a database system 116, for example, a RDBMS. The object 
5 recovery system 124 determines an optimal combination of data restorations and applications 
of the database log 106 to the contents of an object to recover a system to a target time. 

FIG. 2 is a flow diagram illustrating processing performed by the object recovery system 
124 in one embodiment. In particular, the object recovery system 124 identifies individual 
objects that must be recovered after a backup of a database. 
10 In block 200, at a current time, the object recovery system 124 may temporarily render 

the database inactive and flush any necessary data in cache memory to disk. 

In block 210, the object recovery system 124 creates a copy of the database log 106. A 
database log 106 is a record of the database activity which may be used for statistical, backup 
and/or recovery purposes. For example, a database log 106 may record the details of a 
1 5 transaction (i.e., a unit of work) or modifications to the database as a result of, for example, an 
INSERT or DELETE statement. 

In an alternative embodiment, the object recovery system 124 may proceed to block 220 
without making a copy of the database log 106. A copy of the database log 106 is created as a 
precautionary measure since some of the original database log 106 entries may eventually be 
20 permanently deleted. For example, if the database log 106 is truncated at an incorrect time, any 
database log 106 entries that were deleted may not be recoverable. Thus, making a copy of the 
database log 106 may ensure that the original database log may be retrieved if the selection of 
the truncation point is not made with desired confidence or accuracy. 

In block 220, the object recovery system 124 identifies one or more individual objects 
25 that are to be recovered and restored to a target time (i.e., a time before the current time to which 
the database is brought back), and optimizes the processing if desired. 

In block 230, the database log 106 is truncated at the target time (t T ) 340. A user or a 
system may decide where to truncate the database log 106. As a result, only part of the database 
log 1 06 exists, i.e., the truncated database log 1 06 includes data up to the selected target time (t T ) 
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340. In one embodiment, the object recovery system 124 may then begin a new database log 
record beginning from the target time (t T ) 340 for future database activities. 

In block 240, in order to prevent the database from accessing inconsistent objects, the 
obj ect recovery system 1 24 disables access to all of the database obj ects, including any metadata 
5 objects. The object recovery system 124 then restarts the database resulting in a "conditional" 
database. This type of restart will detect URs that were uncommitted at the time the database 
log 106 was truncated, i.e., at the target time (t T ) 340. 

In block 250, the object recovery system 124 creates compensation log records, if 
necessary, and appends (i.e., writes) these compensation log records to the truncated database 
10 log 106 to enable uncommitted changes that were made to the object data to be backed out. 

In block 260, the obj ect recovery system 1 24 restores the identified obj ects that may need 
to be recovered ( i.e., restores objects with uncommitted URs) and identifies objects that may be 
restored without applying the database log 1 06 to those objects. Then in block 270, the database 
log 106 is applied to restored objects, if necessary. In block 280, the object recovery system 
15 124 releases the previous hold on the objects and enables the database to access all of the 
objects. 

B. Quiesce at Current Time 

In this section, the object recover system 124 will quiesce at current time (i.e., the 
memory is flushed and there are no uncommitted URs in the system). 

20 With respect to the identification of objects to be recovered, the object recovery system 

124 scans the database log 1 06 from a checkpoint time (before the target time) to the current time 
and detects when URs have been completed. In other words, the object recovery system 124 
detects all of the URs that were existing in the system from the beginning of the last checkpoint 
time (before the target time) until the current time. 

25 FIG. 3 is a time chart illustrating example scenarios in which a unit of recovery (UR) 

begins and ends at different times relative to a checkpoint, a target time, and a current time. In 
particular, individual objects are identified based on different time variables, and, for each unit 
of recovery, the object recovery system 124 detects the beginning of aUR (b^ 300 and the end 
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of a UR (e UR ) 310. Each UR represents a unit of work consisting of a recoverable sequence of 
operations executed by a database system for an application process. In other words, each UR 
may represent an altered set of data or an altered object. In other words, a UR is defined by 
points such that either all or none of the objects of the database are updated. The COMMIT 
5 statement ends a UR and commits the database changes that were made in that UR. While 
changes remain uncommitted, other application processes are unable to perceive them. Once 
committed, these database changes are accessible by other application processes. 

The time at which a UR begins and ends determines whether the object associated with 
a UR has to be recovered, whether the recovery process can be optimized, and whether the 
10 database log 106 must be applied to those objects. 

The time at which a UR completes is analyzed within a time line based on a checkpoint 
time (t CHPT ) 320, a target time (t T ) 340, and a current time (t c ) 360. The checkpoint time (t CHPT ) 
320 and target time (t x ) 340 occur before the current time (t c ) 360. 

A checkpoint time (t CHPT ) 320 serves as an initial reference point. The object recovery 
15 system 124 begins to scan the database log 106 from checkpoint time (t CHPX ) 320 (before the 
target time (t T ) 340) to the current time. 

FIG. 3 only illustrates the beginning of a checkpoint as indicated by checkpoint time 
(t CHPT ) 320. However, there may also be an end of the checkpoint, and at this time, another 
database log record may be generated. Further, there may be multiple checkpoints, and thus, 
20 multiple checkpoint times, and these scenarios will be discussed further below. One 
embodiment of the present invention, as illustrated in FIG. 3, identifies the beginning of a 
checkpoint as indicated by checkpoint time (t CHPT ) 320 which occurs before a target time (t T ) 340. 

Target time (t T ) 340 is the time to which the object recovery system 124 brings back the 
database. The target time (t T ) 340 represents the time at which the database log 106 will 
25 eventually be truncated such that the truncated database log 106 can be utilized to determine 
which objects contain altered data. For example, assume a user was working with a database and 
saved a file on a Monday at 5:00p.m. Further assume that on the following Tuesday morning, 
the user continued to use the file, and then the user wants to restore the database to represent data 
existing as of Monday at 5:00pm, then the target time (t T ) 340 would be Monday, 5:00pm. The 
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target time (t T ) 340 may be selected or defined by a user, a database administrator, or other 
individual. In an alternative embodiment, the target time is system-defined; that is, a system 
(e.g., a system program) may be configured to select the target time (t T ) 340. 

Following is a detailed description of how objects to be recovered are identified based 
5 on scanning and analyzing the database log 106. The database log 106 is scanned from a 
checkpoint time (t CHPT ) 320 (before the target time (t T ) 340) through the current time (t c ) 360. 
Within this time frame, the object recovery system 124 analyzes the database log 106 to detect 
when one or more URs began and ended. Based on these times, it may or may not be necessary 
to recover the object. 

10 An object may not need to be recovered if an object was committed before the target time 

(t T ) 340. Further, since cached data is flushed at current time (t c ) 360, in these scenarios, both 
the target time (tj) 340 and the current time (t c ) 360 occur after any and all object changes have 
been written to disk since the UR has completed and been committed. Referring to FIG. 3 , Cases 
1, 2 and 4 represent scenarios in which it is not necessary to recover an object. 

1 5 More specifically, the database log 1 06 does not need to be applied to obj ects with URs 

corresponding to Cases 1 ? 2, and 4 since these objects are not even recovered. The objects that 
may have to proceed with the subsequent log apply phase include those objects with URs 
corresponding to Cases 3, 5, and 6. After the database log 106 has been applied to objects, then 
the object recovery system 124 proceeds to block 280. 



20 CASE1 

In Case 1, the object recovery system 124 begins to scan the database log 106 from 
checkpoint time (t CHPT ) 320. In this case, b m 300, the beginning of a UR, and e^ 310, the end 
of a UR, are both completed before the checkpoint time (t CHPT ) 320. Consequently, the object 
recovery system 124 does not detect an uncommitted UR since it detects neither b^ 300 nor e^ 

25 3 1 0. In other words, any changes that were implemented were present at both the target time (t T ) 
340 and the current time (t c ) 360. Thus, it is not necessary to recover objects associated with 
these URs since there is not a concern that an object or object data has changed. 
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CASE 2 

It is not necessary for the object recovery system 124 to recover objects associated with 
URs illustrated in Case 2. In this case, the object recovery system 124 begins a scan of the 
database log 106 from the checkpoint time (t CHPT ) 320. In doing so, the object recovery system 
5 124 does not detect 300, but it does detect e^ 3 10 prior to the target time (t T ) 340. Detecting 
e^ 310 prior to the target time (t T ) 340 signifies that the UR was completed and committed 
before target time (t T ) 340. Consequently, any changes that were implemented existed at both 
the target time (t T ) 340 and the current time (t c ) 360. Therefore, the object recovery system 124 
does not need to recover objects associated with these URs since the data is the same at target 
10 time (t T ) 340 as it is at current time (t c ) 360. 

CASE 3 

In Case 3, the object recovery system 124 begins the scan of the database log 106 at the 
checkpoint time (t CHPX ) 320. The object recovery system 124 does not detect b m 300, however, 
the object recovery system 124 does detect 3 1 0 after target time (t T ) 340. Thus, the UR was 

1 5 not completed and committed until after target time (t T ) 340. Consequently, taking the database 
back to target time (t T ) 340 would not reflect changes made to the object after target time (t T ) 
340. Thus, the object recovery system 124 identifies objects associated with these URs as 
objects that are to be recovered and restored. 

In particular, the obj ect recovery system 1 24 detects 3 1 0 after the target time (tj.) 340, 

20 i.e., after the database log 106 has been truncated. When 310 is identified, the object 
recovery system 124 may generate a compensation record with the appropriate DELETE and 
INSERT statements and append the compensation record to the truncated database log 1 06. As 
a result, the application of the truncated database log 106 in conjunction with the added log 
section results in accurate and current object data. 

25 

CASE 4 

Case 4 provides an additional example in which it is not necessary for the obj ect recovery 
system 124 to recover an object. As the object recovery system 124 scans the database log 1 06 
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from checkpoint time (t CHPT ) 320, the object recovery system 124 detects both b UR 300 and 
310, both of which occur before target time (t T ) 340. The UR is completed and committed 
before the target time (t T ) 340, and as a result, there are no changes to the object or object data 
between the target time (t T ) 340 and the current time (t c ) 360. Consequently, the object recovery 
5 system 124 may or may not recover objects associated with these URs. 

CASE 5 

In Case 5, the object recovery system 124 identifies objects to be recovered. In this case, 
a scan of the database log 106 starting from checkpoint time (t CHPT ) 320 detects b^ 300, which 
occurs before target time (t T ) 340. Continuing the database log 106 scan, the object recovery 
10 system 124 also detects e m 310 after target time (t T ) 340, which signifies that the UR was not 
committed before target time (t T ) 340. Thus, the object recovery system 124 identifies objects 
associated with these URs as objects to be recovered and restored. 

CASE 6 

In Case 6, the object recovery system 124 scans the database log 106 from checkpoint 
15 time (t CHPX ) 320 and detects b^ 300 and e^ 310 after target time (t T ) 340. Thus, the UR was not 
committed until after the target time (t T ) 340, and the object recovery system 124 identifies 
objects with these URs to be recovered and restored. 

C. Target Time After Backup Time 

20 FIG. 4 A is a time chart illustrating a target time (t x ) 410 occurring after a backup time 

(t B ) 400. Backup time (t B ) 400 represents the time at which a backup of the database was 
created. Target time (t T ) 410 represents the time to which the object recovery system 124 brings 
back the database. Current time (t c ) 420 may be a time at which the system is stopped or may 
be a point when the system crashed. 

25 When creating backups of a database , the object recovery system 124 may utilize 

different backup procedures, including for example, volume based backups, object-based 
backups or logical backups. It may be desirable for the object recovery system 124 to utilize 
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volume-based backup procedures if the backup is taken by 'snapping' the entire database. In 

other words, all of the data, metadata and system volumes (such as recovery logs, etc.) may be 

copied simultaneously. In volume-based backups, each object contains an indication of the most 

recent time when all the data (i.e., data blocks or pages) for that particular object resided in the 
5 underlying datasets on disks. In other words, there is an indication whether there is no data in 

the database cache that is more current than the data on disk. This indicator enables the object 

recovery system 124, to apply the database log 106, if necessary. 

Logical objects backups are made by the database itself or a tool that has access to the 

database cache. As a result, data that is cached in the database at the time of taking such a 
10 backup can be copied as well. This places the starting point of the database log 1 16 scan at the 

time when the backup began. 

The backup time (t B ) 400 represents a time at which a fuzzy backup of a recoverable 

database object, data set or volume was taken. Additionally, the backup time (t B ) 400 and the 

closest fuzzy backup times to the target time (t x ) 410. In other words, the backup at (t B ) 400 
15 represents a backup that contains uncommitted URs because, for example, an application was 

manipulating an object when the backups were generated. In an alternative embodiment, it is 

possible that backup time (t B ) 400 does not exist for each obj ect, and the obj ect must be recreated 

entirely from the database log 106. 

For the scenario in FIG. 4 A, the object recovery system 124 restores data, without 
20 restoring the log. As stated above, the current time (t c ) 420 may be a time at which the system 

is stopped or may be a point when the system crashed. 

When the system crashes, the object recovery system 124 may proceed to block 210 

without rendering the database temporarily inactive and/or without flushing cache data to disk. 

For example, the current time (t c ) 420 may represent the time when a user is working with a 
25 database and the database crashes. In this instance, the current time (t c ) 420 represents a time 

when a user experiences a data integrity problem and needs to recover the database contents to 

a target time (t T ) 410. 

In this case, a database Restart operation can not be used to recover the data to the 
recovery point because the data and log are not consistent. Therefore, the object recovery 
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system 124 uses a Conditional Restart with DEFER ALL. This brings all DB2 R members down; 
deletes certain structures, including LOCK structures; restores DB2® data from the backup taken 
at the backup time (t B ) 400;creates a restart record; starts DB2® with the DEFER ALL option, 
which writes logs to backout uncommitted changes; and, resolves in-doubt URs. The object 
5 recovery system 124 uses LOGONLY or Image Copy Recovery for all tablespaces, by 
recovering directory/catalog objects first and following the same order as for disaster recovery. 
The object recovery system 124 also rebuilds all indexes using, for example, LOGONLY 
Recovery or Image Copy Recovery for indexes. 

FIG. 4B is a time chart illustrating several scenarios involving CREATE, DROP, and 

1 0 NEW VOLUME operations. That is, there may be obj ects created between the backup time (t B ) 
400 and target time (t T ) 410 or between the target time (t T ) 410 and current time (t c ) 420. To 
handle objects that were created between the backup time (t B ) 400 and target time (t T ) 410, the 
object recovery system 124 uses an Image Copy Recovery or Rebuild Index. To handle objects 
that were dropped between the backup time (t B ) 400 and target time (t T ) 410, the object recovery 

1 5 system 124 deletes the orphan datasets. Additionally, the object recovery system 124 reformats 
volumes added after the target time (tj.) 410. A trace is used to monitor some operations, such 
as CREATE, DROP, or EXTEND. 

FIG. 4C is a time chart illustrating scenarios with REORG LOG(NO) and LOAD 
LOG(NO) operations. For objects accessed by REORG LOG(NO) and LOAD LOG(NO), the 

20 obj ect recovery system 1 24 uses an Image Copy Recovery. Also, the obj ect recovery system 1 24 
will determine LOG(NO) events prior to restoring the database catalog and directory. Note that 
LOG(NO) objects are objects whose associated transactions were not logged. 

FIG. 4D is a time chart illustrating scenarios with checkpoints CHKPT-A, CHKPT-B, 
and CHKPT-C . The obj ect recovery system 1 24 identifies obj ects that need to be recovered by 

25 scanning database recovery logs to identify write pending objects between CHKPT-A and the 
backup time (t B ) 400. Then, the object recovery system 124 identifies all UR-modified objects 
between the backup time (t B ) 400 and target time (t x ) 410. Alternatively, the object recovery 
system 124 may look for entries in one or more DBMS catalog tables to identify object that are 
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changed across or beyond the backup time (t B ) 400. Then, the object recovery system 124 
recovers the DBMS catalog tables and directory objects. 

D. Target Time Before Backup 
5 FIG. 5 A is a time chart illustrating a target time (t T ) 510 occurring before a backup time 

(t B ) 500. Backup time (t B ) 500 represents the time at which a backup of the database was 
created. Target time (tj) 5 1 0 represents the time to which the obj ect recovery system 1 24 brings 
back the database. Current time (t c ) 520 may be a time at which the system is stopped and is 
quiesced or may be a point when the system crashed. 

10 For the scenario in FIG. 5 A, the object recovery system 124 restores data, and, 

optionally, restores the log. In this case, the object recovery system 124 uses a Conditional 
Restart with DEFER ALL. DEFER ALL indicates that when a system is restarted, access to the 
database is not available. Conditional Restart with DEFER ALL brings all DB2® members 
down; deletes certain structures, including LOCK structures; restores DB2® data and log from 

15 the backup taken at the backup time (t B ) 500; creates a restart record; starts DB2® with the 
DEFER ALL option, which writes logs to backout uncommitted changes; and, resolves in-doubt 
URs. The object recovery system 124 uses Image Copy Recovery for all tablespaces, by 
recovering directory/catalog objects first. The object recovery system also rebuilds all indexes 
using, for example, Image Copy Recovery for indexes. 

20 FIG. 5B time chart illustrating several scenarios involving CREATE, DROP, and NEW 

VOLUME operations. That is, there may be obj ects created between the target time (tp) 5 1 0 and 
the backup time (t B ) 500 or between the backup time (t B ) 500 and current time (t c ) 520. For 
objects that were created between the target time (tj) 5 10 and the backup time (t B ) 500, note that 
the objects will not exist after the DBMS catalog tables are recovered, and the object recovery 

25 system 124 deletes orphan datasets. To handle objects that were dropped after target time (t T ) 
510, the objects will reappear after the catalog is recovered. Additionally, the object recovery 
system 124 reformats volumes added after the backup time (t B ) 500. A trace is used to monitor 
some operations, such as CREATE, DROP, or EXTEND. 
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FIG. 5C time chart illustrating scenarios with checkpoints CHKPT-A, CHKPT-B, and 
CHKPT-C. The object recovery system 124 identifies objects that need to be recovered by 
scanning database recovery logs to identify write pending and UR-modified objects between 
CHKPT-A and the backup time (t B ) 500. Alternatively, the obj ect recovery system 1 24 may look 
5 for entries in one or more DBMS catalog tables to identify object that are across or beyond the 
target time (t T ) 500 before restoring the system using the log suspend copy. 

E. Further Optimizations 

Objects may be optimized to provide more efficient processing (block 220). More 
specifically, objects may be grouped in a more efficient manner such that objects for which 

10 backups reside on the same volumes have the same optimization level. More specifically, the 
object recovery system 124 determines whether it is more efficient to restore and recover only 
identified individual objects or restore entire volumes and recover corresponding objects. In 
making this determination, the object recovery system 124 may consider several different 
factors. For example, the object recovery system 124 may consider whether most of the objects 

15 that need to be recovered reside on the same volume. Restoring the volume and applying the 
database logs 106 to the objects in this instance may be more efficient than restoring individual 
objects. However, the amount of log data that needs to be scanned is generally larger if a 
volume-based recovery is instituted initially. 

20 Trademarks 

IBM, DB2, CICS, OS/390, ODBC, and IMS/DB/DC are trademarks or registered 
trademarks of International Business Machines, Corporation in the United States and/or other 
countries. 

SAP and R/3 are trademarks or registered trademarks of SAP Aktiengesellschaft in the 
25 United States and/or other countries. 

Peoplesoft is a trademark or registered trademark of PeopleSoft, Inc. in the United States 
and/or other countries. 
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Conclusion 

This concludes the description of some embodiments of the invention. The following 
describes some alternative embodiments for accomplishing the present invention. For 
example, any type of computer, such as a mainframe, minicomputer, or personal computer, or 
5 computer configuration, such as a timesharing mainframe, local area network, or standalone 
personal computer, could be used with the present invention. 

The foregoing description of the embodiments of the invention have been presented 
for the purposes of illustration and description. They are not intended to be exhaustive or to 
limit the invention to the precise form disclosed. Many modifications and variations are 
10 possible in light of the above teaching. It is intended that the scope of the invention be 
limited not by this detailed description, but rather by the claims appended hereto. 
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WHAT IS CLAIMED IS: 



1 LA method of recovering data in a database of a database system stored in a 

2 datastore connected to a computer, the method comprising: 

3 scanning a database log, wherein the database log records activities related to the 

4 database; and 

5 identifying one or more individual objects to be recovered to a target time with 

6 reference to a backup time. 

1 2. The method of claim 1, wherein the target time is user-defined. 

1 3 . The method of claim 1 , wherein the backup time is user-defined. 

1 4. The method of claim 1, further comprising analyzing the database log to 

2 detect when a unit of recovery begins and when the unit of recovery ends. 

1 5. The method of claim 4, wherein an object is not recovered when the unit of 

2 recovery accessing that object ends before the target time and there are no pending writes for 

3 the object. 

1 6. The method of claim 5, wherein the unit of recovery begins and ends before a 

2 checkpoint time, and wherein the checkpoint time occurs before the target time. 

1 7. The method of claim 5, wherein the unit of recovery begins before a 

2 checkpoint time, and wherein the unit of recovery ends after the checkpoint time but before 

3 the target time. 

1 8. The method of claim 5, wherein the unit of recovery begins after a checkpoint 

2 time, and wherein the unit of recovery ends before the target time. 
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1 9. The method of claim 4, wherein an object is recovered if the unit of recovery 

2 begins before the target time, and wherein the unit of recovery ends after the target time but 

3 before a current time, wherein the current time represents when object data is recorded to the 

4 database. 

1 1 0. The method of claim 9, wherein the unit of recovery begins before a 

2 checkpoint time. 

1 11. The method of claim 9, wherein the unit of recovery begins after a checkpoint 

2 time but before the target time. 

1 12. The method of claim 4, wherein an object is recovered if the unit of recovery 

2 begins after the target time, and wherein the unit of recovery ends before a current time, 

3 wherein the current time represents when object data is recorded to the database. 

1 13. The method of claim 1, wherein a backup is taken. 

1 14. The method of claim 13, wherein the backup occurs prior to the target time 

2 and further comprising restoring data without restoring the database log. 

1 15. The method of claim 13, wherein the backup occurs after the target time and 

2 further comprising restoring data and optionally restoring the database log. 

1 16. The method of claim 13, further comprising restarting the database system 

2 with a conditional restart with defer all option. 

1 17. The method of claim 1, further comprising flushing cache data to disk. 
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1 18. The method of claim 1, further comprising truncating the database log at the 

2 target time. 



1 19. The method of claim 18, further comprising disabling access to the database 

2 and restarting the database system, wherein restarting detects uncommitted units of recovery. 

1 20. The method of claim 19, further comprising creating a compensation log and 

2 appending the compensation log to the truncated database log beginning from the target time. 

1 21. The method of claim 1 , further comprising restoring the identified objects. 

1 22. The method of claim 21 , further comprising determining whether the database 

2 log should be applied to the restored objects to update the identified objects with current 

3 object data. 



1 23. The method of claim 22, when the determination is made to apply the 

2 database log to the identified objects, further comprising optimizing the identified objects 

3 such that the identified objects may be restored without applying the database log to the 

4 identified objects. 



1 24. The method of claim 23, if the objects can not be optimized, applying the 

2 database log to the restored objects. 

1 25. The method of claim 21, after restoring the identified objects, further 

2 comprising providing access to the identified objects. 

1 26. The method of claim 1 , further comprising optimizing the identified objects 

2 by restoring a volume of the datastore and recovering corresponding objects. 
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27. The method of claim 1, further comprising optimizing the identified objects 
by grouping the identified objects, wherein the grouped objects have backups residing on the 
same volume of the datastore. 



1 28. The method of claim 1, wherein an object is associated with different units of 

2 recovery, wherein one or more units of recovery require different levels of processing, and 

3 wherein the object is recovered utilizing the highest level of processing. 

1 29. The method of claim 1, wherein the one or more individual objects to be 

2 recovered to a target time are recovered from a current time. 

1 30. The method of claim 29, wherein the current time represents at time at which 

2 the database system crashed. 

1 31. An apparatus for recovering data in a database of a database system, 

2 comprising: 

3 a computer having a data store connected thereto, wherein the data store stores data; 

4 and 

5 one or more computer programs, performed by the computer, for scanning a database 

6 log, wherein the database log records activities related to the database and for identifying one 

7 or more individual objects to be recovered to a target time with reference to a backup time. 

1 32. The apparatus of claim 3 1 , wherein the target time is user-defined. 

1 33 . The apparatus of claim 3 1 , wherein the backup time is user-defined. 

1 34. The apparatus of claim 31, further comprising analyzing the database log to 

2 detect when a unit of recovery begins and when the unit of recovery ends. 
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1 35. The apparatus of claim 34, wherein an object is not recovered when the unit of 

2 recovery accessing that object ends before the target time and there are no pending writes for 

3 the object. 

1 36. The apparatus of claim 35, wherein the unit of recovery begins and ends 

2 before a checkpoint time, and wherein the checkpoint time occurs before the target time. 

1 37. The apparatus of claim 35, wherein the unit of recovery begins before a 

2 checkpoint time, and wherein the unit of recovery ends after the checkpoint time but before 

3 the target time. 

1 38. The apparatus of claim 35, wherein the unit of recovery begins after a 

2 checkpoint time, and wherein the unit of recovery ends before the target time. 



1 39. The apparatus of claim 34, wherein an object is recovered if the unit of 

2 recovery begins before the target time, and wherein the unit of recovery ends after the target 

3 time but before a current time, wherein the current time represents when object data is 

4 recorded to the database. 



1 40. The apparatus of claim 39, wherein the unit of recovery begins before a 

2 checkpoint time. 

1 41 . The apparatus of claim 39, wherein the unit of recovery begins after a 

2 checkpoint time but before the target time. 

1 42. The apparatus of claim 34, wherein an object is recovered if the unit of 

2 recovery begins after the target time, and wherein the unit of recovery ends before a current 

3 time, wherein the current time represents when object data is recorded to the database. 
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1 43 . The apparatus of claim 3 1 , wherein a backup is taken. 

1 44. The apparatus of claim 43, wherein the backup occurs prior to the target time 

2 and further comprising restoring data without restoring the database log. 

1 45. The apparatus of claim 43, wherein the backup occurs after the target time and 

2 further comprising restoring data and optionally restoring the database log. 

1 46. The apparatus of claim 43, further comprising restarting the database system 

2 with a conditional restart with defer all option. 

1 47. The apparatus of claim 31, further comprising flushing cache data to disk. 

1 48. The apparatus of claim 31, further comprising truncating the database log at 

2 the target time. 

1 49. The apparatus of claim 48, further comprising disabling access to the database 

2 and restarting the database system, wherein restarting detects uncommitted units of recovery. 

1 50. The apparatus of claim 49, further comprising creating a compensation log 

2 and appending the compensation log to the truncated database log beginning from the target 

3 time. 

1 51. The apparatus of claim 3 1 , further comprising restoring the identified objects. 

1 52. The apparatus of claim 5 1 , further comprising determining whether the 

2 database log should be applied to the restored objects to update the identified objects with 

3 current object data. 



ODMA\PCDOCS\DOCS\23387\8 
IBMST 044762 



24 



STL000O48US1 



1 53. The apparatus of claim 52, when the determination is made to apply the 

2 database log to the identified objects, further comprising optimizing the identified objects 

3 such that the identified objects may be restored without applying the database log to the 

4 identified objects. 

1 54. The apparatus of claim 53, if the objects can not be optimized, applying the 

2 database log to the restored objects. 

1 55. The apparatus of claim 51, after restoring the identified objects, further 

2 comprising providing access to the identified objects. 

1 56. The apparatus of claim 31, further comprising optimizing the identified 

2 objects by restoring a volume of the datastore and recovering corresponding objects. 

1 57. The apparatus of claim 31, further comprising optimizing the identified 

2 objects by grouping the identified objects, wherein the grouped objects have backups residing 

3 on the same volume of the datastore. 

1 58. The apparatus of claim 31, wherein an object is associated with different units 

2 of recovery, wherein one or more units of recovery require different levels of processing, and 

3 wherein the object is recovered utilizing the highest level of processing. 

1 59. The apparatus of claim 31, wherein the one or more individual objects to be 

2 recovered to a target time are recovered from a current time. 

1 60. The apparatus of claim 59, wherein the current time represents at time at 

2 which the database system crashed. 

1 61 . An article of manufacture comprising a computer program carrier readable by 
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2 a computer and embodying one or more instructions executable by the computer for 

3 recovering data in a database of a database system, comprising: 

4 scanning a database log, wherein the database log records activities related to the 

5 database; and 

6 identifying one or more individual objects to be recovered to a target time with 

7 reference to a backup time. 

1 62. The article of manufacture of claim 61, wherein the target time is user- 

2 defined. 

1 63. The article of manufacture of claim 61, wherein the backup time is user- 

2 defined. 

1 64. The article of manufacture of claim 61, further comprising analyzing the 

2 database log to detect when a unit of recovery begins and when the unit of recovery ends. 

1 65. The article of manufacture of claim 64, wherein an object is not recovered 

2 when the unit of recovery accessing that object ends before the target time and there are no 

3 pending writes for the object. 

1 66. The article of manufacture of claim 65, wherein the unit of recovery begins 

2 and ends before a checkpoint time, and wherein the checkpoint time occurs before the target 

3 time. 

1 67. The article of manufacture of claim 65, wherein the unit of recovery begins 

2 before a checkpoint time, and wherein the unit of recovery ends after the checkpoint time but 

3 before the target time. 
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1 68. The article of manufacture of claim 65, wherein the unit of recovery begins 

2 after a checkpoint time, and wherein the unit of recovery ends before the target time. 

1 69. The article of manufacture of claim 64, wherein an object is recovered if the 

2 unit of recovery begins before the target time, and wherein the unit of recovery ends after the 

3 target time but before a current time, wherein the current time represents when object data is 

4 recorded to the database. 

1 70. The article of manufacture of claim 69, wherein the unit of recovery begins 

2 before a checkpoint time. 

1 71 . The article of manufacture of claim 69, wherein the unit of recovery begins 

2 after a checkpoint time but before the target time. 

1 72. The article of manufacture of claim 64, wherein an object is recovered if the 

2 unit of recovery begins after the target time, and wherein the unit of recovery ends before a 

3 current time, wherein the current time represents when object data is recorded to the 

4 database. 

1 73. The article of manufacture of claim 61, wherein a backup is taken. 

1 74. The article of manufacture of claim 73, wherein the backup occurs prior to the 

2 target time and further comprising restoring data without restoring the database log. 

1 75. The article of manufacture of claim 73, wherein the backup occurs after the 

2 target time and further comprising restoring data and optionally restoring the database log. 

1 76. The article of manufacture of claim 73, further comprising restarting the 

2 database system with a conditional restart with defer all option. 
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1 77. The article of manufacture of claim 61, further comprising flushing cache data 

2 to disk. 

1 78. The article of manufacture of claim 61 , further comprising truncating the 

2 database log at the target time. 

1 79. The article of manufacture of claim 78, further comprising disabling access to 

2 the database and restarting the database system, wherein restarting detects uncommitted units 

3 of recovery. 

1 80. The article of manufacture of claim 79, further comprising creating a 

2 compensation log and appending the compensation log to the truncated database log 

3 beginning from the target time. 

1 81. The article of manufacture of claim 61, further comprising restoring the 

2 identified objects. 

1 82. The article of manufacture of claim 81, further comprising determining 

2 whether the database log should be applied to the restored objects to update the identified 

3 objects with current object data. 

1 83. The article of manufacture of claim 82, when the determination is made to 

2 apply the database log to the identified objects, further comprising optimizing the identified 

3 objects such that the identified objects may be restored without applying the database log to 

4 the identified objects. 

1 84. The article of manufacture of claim 83, if the objects can not be optimized, 

2 applying the database log to the restored objects. 



ODMA\PCDOCS\DOCS\233 87\S 
IBMST 044762 



28 



STL000048US1 



1 85. The article of manufacture of claim 81, after restoring the identified objects, 

2 further comprising providing access to the identified objects. 



1 86. The article of manufacture of claim 61, further comprising optimizing the 

2 identified objects by restoring a volume of the datastore and recovering corresponding 

3 objects. 

1 87. The article of manufacture of claim 6 1 , further comprising optimizing the 

2 identified objects by grouping the identified objects, wherein the grouped objects have 

3 backups residing on the same volume of the datastore. 



1 88. The article of manufacture of claim 61, wherein an object is associated with 

2 different units of recovery, wherein one or more units of recovery require different levels of 

3 processing, and wherein the object is recovered utilizing the highest level of processing. 

1 89. The article of manufacture of claim 61, wherein the one or more individual 

2 objects to be recovered to a target time are recovered from a current time. 



1 90. The article of manufacture of claim 79, wherein the current time represents at 

2 time at which the database system crashed. 
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FAST SYSTEM LEVEL RECOVERY 



ABSTRACT OF THE DISCLOSURE 
When recovering data in a database, initially, a database log, which records activities 
related to a database, is scanned. One or more individual objects that are to be recovered to a 
target time with reference to a backup time are identified. 
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